FIGURE 1 



Target segment starts with AA 

Target segment: 5' -> 3* 



GC Ratio Position 



siRNA Sense strand: 5' -> 3' 
siRNA Antisense strand: 3' -> 5* 



AAGATCCTCAAGAATGACGCC 0.48 1 1 7 

[SEQ ID NO: 18] 



AAGAATGACGCCTACCGCAAC 0.52 126 

[SEQ ID NO: 19] 



AACCGCATCATCAAGAACGAG 0.48 144 

[SEQ ID NO:20] 



AAGAACGAGCTGAAGCTGCTG 0.52 156 

[SEQ ID NO:21] 



AAGGAGCTGGCTATCATCCAC 0.52 381 

[SEQ ID NO:22] 



AAGCCTGAGAACATCATGCTG 0.48 41 1 

[SEQ ID NO:23] 



AACATCATGCTGGTGGACCAG 0.52 420 

[SEQ ID NO:24] 



AAGGTGATTGACTTCGGATCC 0.48 462 

[SEQ ID NO:25] 



AAGGAGCCATACATCCAGTCG 0.52 513 

[SEQ ID NO:26] 



GAUCCUCAAGAAUGACGCCUU 
[SEQ ID NO:35] 

UUCUAGGAGUUCUUACUGCGG 
[SEQ ID NO:52] 

GAAUGACGCCUACCGCAACUU 
[SEQ ID NO:36] 

UUCUUACUGCGGAUGGCGUUG 
[SEQ ID NO:53] 

CCGCAUCAUCAAGAACGAGUU 
[SEQ ID NO:37] 

UUGGCGUAGUAGUUCUUGCUC 
[SEQ ID NO:54] 

GAACGAGCUGAAGCUGCUGUU 
[SEQ ID NO:38] 

UUCUUGCUCGACUUCGACGAC 
[SEQ ID NO:55] 

GGAGCUGGCUAUCAUCCACUU 
[SEQ ID NO:39] 

UUCCUCGACCGAUAGUAGGUG 
[SEQ ID NO:56] 

GCCUGAGAACAUCAUGCUGUU 
[SEQ ID NO:40] 

UUCGGACUCUUGUAGUACGAC 
[SEQ ID NO:57] 

CAUCAUGCUGGUGGACCAGUU 
[SEQ ID NO:41] 

UUGUAGUACGACCACCUGGUC 
[SEQ ID NO:58] 

GGUGAUUGACUUCGGAUCCUU 
[SEQ ID NO:42] 

UUCCACUAACUGAAGCCUAGG 
[SEQ ID NO:59] 

GGAGCCAUACAUCCAGUCGUU 
[SEQ ID NO:43] 

UUCCUCGGUAUGUAGGUCAGC 
[SEQ ID NO:60] 



AACAACGAGTACGACCAGGTG 
[SEQ ID NO:27] 



AAGTCGTTGGACCAGATTGAG 
[SEQ ID NO:28] 



AAGAGCATGGTGGAGCTGATC 
[SEQ ID NO:29] 



AATGCGGTCTCCGACATGATG 
[SEQ ID NO:30] 



AAGTCCGACTCCAACTTCAGC 
[SEQ ID NO:31] 



AACTTCAGCAACCTCATTCGG 
[SEQ ID NO:32] 



AACATGACCATGGAAGCTGAG 
[SEQ ID NO:33] 



AATGGCTGAGTGAGCCAGACT 
[SEQ ID NO:34] 



0.52 651 CAACGAGUACGACCAGGUGUU 
[SEQ ID NO:44] 

UUGUUGCUCAUGCUGGUCCAC 
[SEQ ID NO:61] 

0.48 855 GUCGUUGGACCAGAUUGAGUU 
[SEQ ID NO:45] 

UUCAGCAACCUGGUCUAACUC 
[SEQ ID NO:62] 

0.52 951 GAGCAUGGUGGAGCUGAUCUU 
[SEQ ID NO:46] 

UUCUCGUACCACCUCGACUAG 
[SEQ ID NO:63] 

0.52 1341 UGCGGUCUCCGACAUGAUGUU 
[SEQ ID NO:47] 

UUACGCCAGAGGCUGUACUAC 
[SEQ ID NO:64] 

0.52 1485 GUCCGACUCCAACUUCAGCUU 
[SEQ ID NO:48] 

UUCAGGCUGAGGUUGAAGUCG 
[SEQ ID NO:65] 

0.48 1497 CUUCAGCAACCUCAUUCGGUU 
[SEQ ID NO:49] 

UUGAAGUCGUUGGAGUAAGCC 
[SEQ ID NO:66] 

0.48 1647 CAUGACCAUGGAAGCUGAGUU 
[SEQ ID NO:50] 

UUGUACUGGUACCUUCGACUC 
[SEQ ID NO:67] 

0.52 1711 UGGCUGAGUGAGCCAGACUUU 
[SEQ ID NO:51] 

UUACCGACUCACUCGGUCUGA 
[SEQ ID NO:68] 



Target segment starts with CA 

Target segment: 5* -> 3' GC 



CAAGATCCTCAAGAATGACGC 0.48 
[SEQ ID NO:69] 



CAAGAATGACGCCTACCGCAA 0.52 
[SEQ ID NO:70] 



Position siRNA Sense strand: 5' -> 3 f 

siRNA Antisense strand: 3' -> 5* 

1 1 6 AG AU CCU C AAG AAU G ACG C U U 
[SEQ ID NO:89] 

UUUCUAGGAGUUCUUACUGCG 
[SEQIDNO:109] 

1 25 AGAAUGACGCCUACCGCAAUU 
[SEQ ID NO:90] 

UUUCUUACUGCGGAUGGCGUU 
[SEQ ID NO:110] 



CAACCGCATCATCAAGAACGA 0.48 143 

[SEQ ID NO:71] 



CATCAAGAACGAGCTGAAGCT 0.48 152 

[SEQ ID NO:72] 



CAAGAACGAGCTGAAGCTGCT 0.52 1 55 

[SEQ ID NO:73] 



CATCCGCTTCCTTGAGTTCTT 0.48 215 

[SEQ ID NO:74] 



CAGAAGGAGAACAACTTCGCG 0.52 297 

[SEQ ID NO:75] 



CAAGGAGCTGGCTATCATCCA 0.52 380 

[SEQ ID NO:76] 



C AAG C CTG AG AAC ATC ATG CT 0.48 410 

[SEQ ID NO:77] 



CAAGGTGATTGACTTCGGATC 0.48 461 

[SEQ ID NO:78] 



CATACATCCAGTCGCGCTTCT 0.52 520 

[SEQ ID NO:79] 



CAACAACGAGTACGACCAGGT 0.52 650 

[SEQ ID NO:80] 



CACCACTTCTTCAAGCGCAAC 0.52 735 

[SEQIDNO:81] 



ACCGCAUCAUCAAGAACGAUU 
[SEQ ID NO:91] 

UUUGGCGUAGUAGUUCUUGCU 
[SEQ ID NO:111] 

UCAAGAACGAGCUGAAGCUUU 
[SEQ ID NO:92] 

UUAGUUCUUGCUCGACUUCGA 
[SEQ ID NO: 112] 

AGAACGAGCUGAAGCUGCUUU 
[SEQ ID NO:93] 

UUUCUUGCUCGACUUCGACGA 
[SEQ ID NO:113] 

UCCGCUUCCUUGAGUUCUUUU 
[SEQ ID NO:94] 

U U AG G C G AAG G AAC U C AAG A A 
[SEQ ID NO: 11 4] 

GAAGGAGAACAACUUCGCGUU 
[SEQ ID NO:95] 

UUCUUCCUCUUGUUGAAGCGC 
[SEQ ID NO:115] 

AGGAGCUGGCUAUCAUCCAUU 
[SEQ ID NO:96] 

UUUCCUCGACCGAUAGUAGGU 
[SEQ ID NO:116] 

AGCCUGAGAACAUCAUGCUUU 
[SEQ ID NO:97] 

UUUCGGACUCUUGUAGUACGA 
[SEQ ID NO: 117] 

AGGUGAUUGACUUCGGAUCUU 
[SEQ ID NO:98] 

UUUCCACUAACUGAAGCCUAG 
[SEQIDNO:118] 

UACAUCCAGUCGCGCUUCUUU 
[SEQ ID NO:99] 

UUAUGUAGGUCAGCGCGAAGA 
[SEQ ID NO: 119] 

ACAACGAGUACGACCAGGUUU 
[SEQ ID NO:100] 

UUUGUUGCUCAUGCUGGUCCA 
[SEQ ID NO:120] 

CCACUUCUUCAAGCGCAACUU 
[SEQ ID NO:101] 

UUGGUGAAGAAGUUCGCGUUG 
[SEQ ID NO:121] 



CAAGTCGTTGGACCAGATTGA 0.48 854 

[SEQ ID NO:82] 



CAAGAGCATGGTGGAGCTGAT 0.52 950 

[SEQ ID NO:83] 



CAATGCGGTCTCCG ACATGAT 0. 52 1 340 

[SEQ ID NO:84] 



CAAGTCCGACTCCAACTTCAG 0.52 1484 

[SEQ ID NO:85] 



C AACTTC AG C AAC CTC ATTC G 0.48 1496 

[SEQ ID NO:86] 



C AAC ATG ACC ATG G AAG CTG A 0.48 1646 

[SEQ ID NO:87] 



CATGACCATGGAAGCTGAGAG 0.52 1649 

[SEQ ID NO:88] 



AGUCGUUGGACCAGAUUGAUU 
[SEQ ID NO: 102] 
UUUCAGCAACCUGGUCUAACU 
[SEQ ID NO: 122] 

AGAGCAUGGUGGAGCUGAUUU 
[SEQ ID NO: 103] 
UUUCUCGUACCACCUCGACUA 
[SEQ ID NO: 123] 

AUGCGGUCUCCGACAUGAUUU 
[SEQ ID NO: 104] 
UUUACGCCAGAGGCUGUACUA 
[SEQ ID NO: 124] 

AGUCCGACUCCAACUUCAGUU 
[SEQ ID NO: 105] 

UUUCAGGCUGAGGUUGAAGUC 
[SEQ ID NO: 125] 

ACUUCAGCAACCUCAUUCGUU 
[SEQ ID NO: 106] 

UUUGAAGUCGUUGGAGUAAGC 
[SEQ ID NO: 126] 

ACAUGACCAUGGAAGCUGAUU 
[SEQ ID NO: 107] 
UUUGUACUGGUACCUUCGACU 
[SEQ ID NO: 127] 

UGACCAUGGAAGCUGAGAGUU 
[SEQ ID NO: 108] 
UUACUGGUACCUUCGACUCUC 
[SEQ ID NO: 128] 



Target segment starts with GA 

Target segment: 5' -> 3* 



GC Ratio Position 



siRNA Sense strand: 5* -> 3' 
siRNA Antisense strand: 3' -> 5' 



GAGATGGTGGCCATCAAGATC 0.52 102 

[SEQ ID NO: 129] 



GATGGTGGCCATCAAGATCCT 0.52 104 

[SEQ ID NO:130] 



G ATC CTC AAG AATG AC G C CT A 0.48 1 1 9 

[SEQ ID NO:131] 



GAUGGUGGCCAUCAAGAUCUU 
[SEQ ID NO: 144] 
UUCUACCACCGGUAGUUCUAG 
[SEQ ID NO: 159] 

UGGUGGCCAUCAAGAUCCUUU 
[SEQ IDNO:145] 
UUACCACCGGUAGUUCUAGGA 
[SEQ ID NO: 160] 

UCCUCAAGAAUGACGCCUAUU 
[SEQ ID NO:146] 
UUAGGAGUUCUUACUGCGGAU 
[SEQ IDNO:161] 



GAGTTCCAGAAGGAGAACAAC 0.48 291 

[SEQ ID NO: 132] 



GATCTCAAGCCTGAGAACATC 0.48 405 

[SEQ ID NO: 133] 



GAGAACATCATGCTGGTGGAC 0.52 417 

[SEQ ID NO: 134] 



GAACATCATGCTGGTGGACCA 0.52 419 

[SEQ ID NO: 135] 



GAAGGAGCCATACATCCAGTC 0.52 512 

[SEQ ID NO: 136] 



GATTGAGACAGTGAATGGTGG 0.48 869 

[SEQ ID NO: 137] 



GAGACAGTGAATGGTGGCAGT 0.52 873 

[SEQ ID NO:138] 



G AC AGTG AATG GTG GC AGTGT 0.52 875 

[SEQ ID NO: 139] 



GAGCATGGTGGAGCTGATCAA 0.52 953 

[SEQ ID NO:140] 



GAGAAGGCACCAGGTATGCAA 0.52 1248 

[SEQ ID NO:141] 



GACTCCAACTTCAGCAACCTC 0.52 1491 

[SEQ ID NO:142] 



GUUCCAGAAGGAGAACAACUU 
[SEQ ID NO:147] 

UUCAAGGUCUUCCUCUUGUUG 
[SEQ ID NO: 162] 

UCUCAAGCCUGAGAACAUCUU 
[SEQ ID NO: 148] 

UUAGAGUUCGGACUCUUGUAG 
[SEQ ID NO:163] 

GAACAUCAUGCUGGUGGACUU 
[SEQ ID NO:149] 
UUCUUGUAGUACGACCACCUG 
[SEQ ID NO:164] 

ACAUCAUGCUGGUGGACCAUU 
[SEQ ID NO: 150] 
UUUGUAGUACGACCACCUGGU 
[SEQ ID NO: 165] 

AGGAGCCAUACAUCCAGUCUU 
[SEQ ID NO:151] 

UUUCCUCGGUAUGUAGGUCAG 
[SEQ ID NO:166] 

UUGAGACAGUGAAUGGUGGUU 
[SEQ ID NO:152] 
UUAACUCUGUCACUUACCACC 
[SEQ ID NO:167] 

GACAGUGAAUGGUGGCAGUUU 
[SEQ ID NO:153] 
UUCUGUCACUUACCACCGUCA 
[SEQ ID NO: 168] 

CAGUGAAUGGUGGCAGUGUUU 
[SEQ ID NO: 154] 
UUGUCACUUACCACCGUCACA 
[SEQ ID NO: 169] 

GCAUGGUGGAGCUGAUCAAUU 
[SEQ ID NO: 155] 
UUCGUACCACCUCGACUAGUU 
[SEQ ID NO: 170] 

GAAGGCACCAGGUAUGCAAUU 
[SEQ ID NO: 156] 
UUCUUCCGUGGUCCAUACGUU 
[SEQ ID NO:171] 

CUCCAACUUCAGCAACCUCUU 
[SEQ ID NO: 157] 

UUGAGGUUGAAGUCGUUGGAG 
[SEQ ID NO: 172] 



G AC AAC ATG AC C ATG G AAG CT 0.48 1644 CAACAUGACCAUGGAAGCUUU 

[SEQ ID NO: 143] [SEQ ID NO: 158] 

UUGUUGUACUGGUACCUUCGA 
[SEQ ID NO: 173] 



Target segment starts with TA 

Target segment: 5' -> 3' 



TACCGCAACCGCATCATCAAG 
[SEQ ID NO: 174] 



TACGTGAAGGAGCCATACATC 
[SEQ ID NO:175] 



TACATCCAGTCGCGCTTCTAC 
[SEQ ID NO:176] 



TATGCTCAAGTCGTTGGACCA 
[SEQ ID NO:177] 



TACTACTGTCTGGCTGAGGAG 
[SEQ ID NO: 178] 



TACTGTCTGGCTGAGGAGAAG 
[SEQ ID NO: 179] 



GC Ratio Position siRNA Sense strand: 5' -> 3' 

siRNA Antisense strand: 3' -> 5' 

0.52 138 CCGCAACCGCAUCAUCAAGUU 
[SEQ ID NO: 180] 

UUGGCGUUGGCGUAGUAGUUC 
[SEQ ID NO: 186] 

0.48 507 CGUGAAGGAGCCAUACAUCUU 
[SEQ ID NO:181] 

UUGCACUUCCUCGGUAUGUAG 
[SEQ ID NO:187] 

0.52 522 CAUCCAGUCGCGCUUCUACUU 
[SEQ ID NO:182] 

U U G U AG G U C AGCG C G AAG AU G 
[SEQ ID NO: 188] 

0.48 848 UGCUCAAGUCGUUGGACCAUU 
[SEQ ID NO: 183] 
UUACGAGUUCAGCAACCUGGU 
[SEQ ID NO: 189] 

0.52 1173 CUACUGUCUGGCUGAGGAGUU 
[SEQ ID NO: 184] 
UUGAUGACAGACCGACUCCUC 
[SEQ ID NO: 190] 

0.52 1176 CUGUCUGGCUGAGGAGAAGUU 
[SEQ ID NO: 185] 
UUGACAGACCGACUCCUCUUC 
[SEQIDNO:191] 
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FIGURE 2 



A 

Protein Sequence of Human HIPK4 

Total length -- 616 AA 

MSTIQSETDCYDIIEVLGKGTFGEVAKGWRRSTGEMVAIKILKNDAYRNRIIKNELKLLHCMRGLDPEEAHVIRFLEFFH 
DAliKFYLVFELLEQNLFEFQKENNFAPLPARHIRTVTLQVL»TAIiARLKELiAI IHADLKPENIMLVDQTRCPFRVKVIDFG 
SASIFSEWYVKEPYIQSRFYRAPEILLGLPFCEKVDVWSLGCVMAELHLGWPLYPGNNEYDQVRYICETQGLPKPHLLH 
AACKAHHFFKRNPHPDAANPWQLKSSADYLAETKVRPLERRKYMLKSLDQIETW^ 

VELIKRMLTWESHERISPSAALRHPFVSMQQLRSAHETTHYYQLSLRSYRLSLQVEGKPPTPWAAEDGTPYYCLAEEKE 
AAGMGSVAGSSPFFREEKAPGMQRAIDQLDDLSLQEAGHGLWGETCTNAVSDM^ 

RLAGRHKARKPPAGSKSDSNFSNLIRLSQVSPEDDRPCRGSSWEEGEHLGASAEPLAILQRDEDGPNIDNMTMEAERPDP 
ELFDPSSCPGEWLSEPDCTLESVRGPRAQGLPPRRSHQHGPPRGATSFLQHVTGHH 

B 

ATP Binding Domain -- 17-40 (predicted) 

•LGKGTFGEVAKGWRRSTGEMVAI K 

c 

Serine/Threonine Binding Domain -- 132-144 (predicted) 

I IHADLKPENIML 

D 

Protein Kinase Domain 11-347 (predicted) 

YDI I EVLGKGTFGEVAKGWRRSTGEMVAI KI LKNDAYRNRI I KNELKLLHCMRGLDPEEAHVI RFLEFFHDALKF YLVFE 
LLEQNLFEFQKENNFAPLPARHIRTVTLQVLTALARLKELAI IHADLKPENIML VDQTRCPFRVKVIDFGSAS I FSEVRY 
VKEPYIQSRFYRAPEILLGLPFCEKVDVWSLGCVMAELHLGW 

RNPHPDAANPWQLKSSADYLAETK^RPLERRKYMLKSLDQIETWGGSVASRLTFPDREALAEHADLKSWELIKRMLTW 
ESHERISPSAALRHPFV 
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FIGURE 3 



A 

Protein Sequence of Mouse HJPK4 

Total length 616 AA 

MATIQSETDCYDI IEVLGKGTFGEVAKGWRRSTGE^WAIKILKNDAYRSRI IKNELKLLRCVRGLDPDEAHVIRFLEFFH 
DALKFYLVFELLEQNLFEFQKENNFAPLPARHIRTVTLQVLRALARLKELAI IHADLKPENIMLVDQTRCPFRVKVIDFG 
SASIFSEVRYVKEPYIQSRFYRAPEILLGLPFCEKVDWSLGCVMAELHLGWPLYPGNNEYDQVRYICETQGLPKPHLLH 
AARKAHHFFKRNPHPDATNPWQLKSSADYLAETKVRPLERRKYMLKSLDQIETVNGGGAVSRLSFPDREALAEHADLKSM 
VELIKRMLTWESHERISPSAALRHPFVSMQQLRSAHEATRYYQLSLRGCRLSLQVDGKPPPPVIASAEDGPPYYRLAEEE 
ETAGLGGVTGSGS FFREDKAPGMQRAI DQLDDLSLQEARRGLWSDTRADMVSDMLVPLKVASTSHRVPDSGPE PI LAF YG 
SRLTGRHKARKAPAGSKSDSNFSNLIRLSQASPEDAGPCRGSGWEEGEGRTTSTEPSVIPQREGDGPGIKDRPMDAERPG 
PELFDPSSCPGEWLSEPEWTLEGIRGSRAQGLPAHHPHPHGPPRTTSFLQHVGGHH 

B 

ATP Binding Domain 17-40 (predicted) 

LGKGTFGEVAKGWRRSTGEMVAI K 

c 

Serine/Threonine Binding Domain 13 2-144 (predicted) 

I IHADLKPENIML 

D 

Protein Kinase Domain -- 11-347 (predicted) 

YDIIEVLGKGTFGEVAKGWRRSTGEMVAIKILKNDAYRSRIIKNELKLLRCVRGLDPDEAHVIRFLEFFHDALKFYLVFE 
LLEQNLFEFQKENNFAPLPARHIRTVTLQVLRALARLKE LAI IHADLKPENIML VDQTRCPFRVKVIDFGSASIFSEVRY 
VKEPYIQSRFYRAPEILLGLPFCEKVDVWSLGCVMAELHLGWPLYPGNNEYDQVRYICETQGLPKPHLLHAARKAHHFFK 
RNPHPDATNPWQLKSSADYLAETKVRPLERRKYMLKSLDQIETWGGGAVSRLSFPDREALAEHADLKSMVELIKRMLTW 
ESHERISPSAALRHPFV 
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FIGURE 4 

A 

Protein Sequence of Monkey HIPK4 

Total length -- 616 AA 

MATTQS ETDC YD 1 1 E VLGKGTFGE VAKGWRRSTGEMVAI KI LKNDAYRNR 1 1 KNELKLLHCMRGLD PEEAHVI RFLE F FH 
DALKFYLVFELLEQNLFEFQKENNFAPLPARHIRTVTLQVLRAIiARLKELAI IHADLKPENIMLVDQTRCPFRVKVIDFG 
SASIFSEWYVKEPYIQSRFYRAPEILLGLPFCEKVDWSLGCV^1AELHLGWPLYPG^WEYDQVRYICETQGLPKPHLLH 
AARKAHHFFKRNPHPDAANPWQLKSSADYLAETKVRPLERRKYMLKSL^ 

VELIKRMLTWESHERISPSAALRHPFVSMQQLRNAHETTHYYQLSLRSYRLSLQVEGKPPAPWAAEDGTPYYRLAEEKE 
AAGMGSVASSSPFFREEKAPGMQRAIDQLDDLSLQEAGHGLWGETCTDVVSDMMAPLKAAITGRHMPDSGPEPILAFYSS 
RLAGRHKARKPPAGSKSDSNLSNLIRLSQVSPEDDRPCRGSSWEEGEHLGASAEPPAILQRDGDGPNIDNMTMEAERPDP 
ELFDPSSCPGEWLSEPDWTLEGVRGPRAQGLPPRRSHQHGPPRGATSFLQHVTGHH 

B 

ATP Binding Domain 17-40 (predicted) 

LGKGTFGEVAKGWRRSTGEMVAIK 

c 

Serine/Threonine Binding Domain -- 13 2-144 (predicted) 

IIHADLKPENIML 

D 

Protein Kinase Domain -- 11-347 (predicted) 

YDI I EVLGKGTFGEVAKGWRRSTGEMVAI KILKNDAYRNRI I KNELKLLHCMRGLDPEEAHVI RFLEFFHDALKFYLVFE 
LLEQNLFEFQKENNFAPLPARHIRTVTLQVLRALARLKEI^IIHADLKPENIMLVDQTRCPFRVKVIDFGSASIFSEVRY 
VKEPYIQSRFYRAPEILLGLPFCEKOTVWSLGCVMAELHLGWPLYPGNNEYDQVRYICETQGLPKPHLLHAARKAHHFFK 
RNPH PDAANPWQLKS S ADYLAETKVRPLERRKYMLKS LDQ I ETVNGGS VASRLTF PDREALAEHADLKSMVEL I KRMLTW 
ESHERISPSAALRHPFV 
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FIGURE 5 



A 

Protein Sequence of Rat HIPK4 

Total length -- 616 AA 

MATIQSETDCYDI I EVLGKGTFGEVAKGWRRSTGEMVAI KI LKNDAYRSRI I KNELKLLRCVRGLDPDEAHVI RFLEFFH 
DALKFYLVFELLEQNLFEFQKENNFAPLPARHIRTVTLQVLRALARLKELAIIHADLKPENIMLTO^ 

SASIFSEWYVKEPYIQSRFYRAPEILLGLPFCEKVDWSLGCVl^ELHLGWPLYPG^EYDQVRYICETQGLPKPHLL^ 

AARKAHHFFKRNPHPDATNPWQLKSSADYLAETKWPLERRKYMLKSLDQIETWGGGAVSRLSFPDRE 

VELIKRMLTWESHERISPSAALRHPFVSMQQLRSAHEATRYYQLSLRGCRLSLQVDGKPPPPVIASAEDGPPYYRLAEEE 

ETAGLGGVTGSGSFFREDKAPGMQRAIDQLDDLSLQEARRGLWSDTRADlWSDMLVPLKVASTSHRVPDSGPEPIIiAFYG 

SRLTGRHKARKAPAGSKSDSNFSNLIRLSQASPEDAGPCRGSGWEEGEGRTTSTEPSVIPQREGDGPGIKDRPMDAERPG 

PELFDPSSCPGEWLSEPEWTLEGIRGSRAQGLPAHHPHPHGPPRTTSFLQHVGGHH 

B 

ATP Binding Domain -- 17-40 (predicted) 

LGKGTFGEVAKGWRRSTGEMVAIK 

c 

Serine/Threonine Binding Domain -- 132-144 (predicted) 

I IHADLKPENIML 

D 

Protein Kinase Domain -- 11-347 (predicted) 

YDI I EVLGKGTFGEVAKGWRRSTGEMVAI KI LKNDAYRSRI I KNELKLLRCVRGLDPDEAHVI RFLEFFHDALKFYLVFE 
LLEQNLFEFQKENNFAPLPARHIRTVTLQVLRALARLKELAI IHADLKPENIML VDQTRCPFRVKVIDFGSASI FSEVRY 
VKEPYIQSRFYRAPEILLGLPFCEKVDVWSLGCVMAELHLGWPLYPGNNEYDQVRYICETQGLPKPHLLHAARKAHHFFK 
RNPHPDATNPWQLKSSADYLAETKVRPLERRKYMLKSLDQIETVNGGGAVSRLSFPDREALAEHADLKSMVELIKRMLTW 
ESHERISPSAALRHPFV 
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FIGURE 6 



CLUSTAL W multiple sequence alignment results 



MSF: 346 Type: P 



Check: 4 900 



Name: mushipk3_np_034 564 . kd . pep oo Len 

Name: rathipk3_np_113 97 5 . kd . f as . pep oo 

Name: humhipk3_np_00 572 5 . kd.pep oo Len 

Name: humhipkl- like_ax 3 182 66 .kd.pep oo 

Name: mushipkl_np_034 562 . kd . pep oo Len 

Name: humhipk2_np_07 357 7 . kd . f as -pep oo 

Name: mushipk2_np_0 34 56 3 . kd . pep oo Len 

Name: wormf 2 0b6_ql 9632 . kd.pep oo Len: 

Name: humhipk4 . kd . f as .pep oo Len: 34 6 

Name: monkeyhipk4_ab074 44 9 . kd . f as . pe oo 

Name: mushipk4 . kd . f as . pep oo Len: 34 6 

Name: rathipk4_xp_2183 55 . kd . f as . pep oo 

Name: yeastyakl_np_0123 94 . kd . f as . pep oo 

Name: dmcgl7 0 90_q95tg3 . kd .pep oo Len: 



34 6 Check: 918 9 Weight: 7.5 
Len: 346 Check: 347 Weight: 6.5 

346 Check: 9094 Weight: 7.8 
Len: 34 6 Check: 7 73 Weight: 6.8 

346 Check: 878 Weight: 7.0 
Len: 346 Check: 11 Weight: 6.7 
34 6 Check: 104 7 Weight: 7.7 
346 Check: 9639 Weight: 16.4 
Check: 4355 Weight: 5.4 

Len: 346 Check: 4346 Weight: 5. 
Check: 5897 Weight: 5.4 
Len: 346 Check: 5822 Weight: 4.8 
Len: 346 Check: 3415 Weight: 12 
346 Check: 87 Weight: 26.0 



// 

mushipk3_np_ 
rathipk3_np_ 
humh i pk 3 _np_ 
humhipkl - lik 
mushipkl_np_ 
humh i p k 2 _np_ 
mushipk2_np_ 
wormf 2 0b6_ql 
humhipk4 . kd . 
monkeyhipk4_ 
mushipk4 . kd . 
rathipk4_xp_ 
yeas t yak l_np 
dmcgl7 0 90_q9 



YEVLDFLGRG 
YEVLDFLGRG 
YEVLDFLGRG 
YEVLEFLGRG 
YEVLEFLGRG 
YEVLEFLGRG 
YEVLEFLGRG 
YEVLEFLGKG 
YDIIEVLGKG 
YDIIEVLGKG 
YDIIEVLGKG 
YDIIEVLGKG 
YLVLDILGQG 
. . . YIFNCLD 



TFGQWKCWK 
TFGQWKCWK 
TFGQWKCWK 
TFGQVAKCWK 
TFGQVAKCWK 
TFGQWKCWK 
TFGQWKCWK 
TFGQWKAWK 
TFGEVAKGWR 
TFGEVAKGWR 
TFGEVAKGWR 
TFGEVAKGWR 
TFGQWKCQN 
DIGQVNVPTD 



RGTNEIVAIK 
RGTNEIVAIK 
RGTNEIVAIK 
RSTKEIVAIK 
RSTKEIVAIK 
RGTNEIVAIK 
RGTNEIVAIK 
KGTSEIVAIK 
RSTGEMVAIK 
RSTGEMVAI K 
RSTGEMVAIK 
RSTGEMVAIK 
LLTKEILAVK 
LEGGQLLAEK 



ILKNHPSYAR 
ILKNHPSYAR 
ILKNHPSYAR 
ILKNHPSYAR 
ILKNHPSYAR 
ILKNHPSYAR 
ILKNHPSYAR 
ILKKHPSYAR 
I LKNDAYRNR 
I LKNDAYRNR 
ILKNDAYRSR 
ILKNDAYRSR 
WKSRTEYLT 
TDRREFIDLL 
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QGQ I EVS I LA 
QGQIEVSILA 
QGQ I EVS I LA 
QGQ I EVS I LS 
QGQ I EVS I LS 
QGQIEVSILA 
QGQIEVSILA 
QGQ I EVS I LS 
IIKNELKLLH 
I IKNELKXjLH 
IIKNELKLLR 
IIKNELKLLR 
QSITEAKILE 
KRMLTIDQER 



mushipk3_np_ 
rathipk3_np_ 
humh i p k 3 _np_ 
humhipkl - lik 
mushipkl_np_ 
humh i pk 2 _np_ 
mushipk2_np_ 
wormf 2 0b6_ql 
humhipk4 . kd . 
monkeyhipk4_ 
mushipk4 . kd . 
rathipk4_xp_ 
yeastyakl_np 
dmcgl7090_q9 
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RLSTENA . 
RLSTENA . 
RLSTENA . 
RLSSENA. 
RLSSENA . 
RLSTESA. 
RLSTESA . 
RLSNENS . 
CMRGLDP . 
CMRGLDP . 
CVRGLDP . 
CVRGLDP . 
LLNQKIDPTN 
RLTPAEALNH 



EYNFVRAYEC 
EYNFVRAYEC 
EYNFVRAYEC 
EYNFVRSYEC 
EYNFVRSYEC 
DYNFVRAYEC 
DYNFVRAYEC 
EFNFVRAFEC 
EAHVIRFLEF 
EAHVIRFLEF 
EAHVIRFLEF 
EAHVIRFLEF 
KHHFLRMYDS 
SFTRLTHLVD 



FQHRNHTCLV 
FQHRNHTCLV 
FQHRNHTCLV 
FQHKNHTCLV 
FQHKNHTCLV 
FQHKNHTCLV 
FQHKNHTCLV 
FNHKSHTCLV 
FHDALKFYLV 
FHDALKFYLV 
FHDALKFYLV 
FHDALKFYLV 
FVHKNHLCLV 
YVYCNNVKAS 



FEMLEQNLYD 
FEMLEQNLYD 
FEMLEQNLYD 
FEMLEQNLYD 
FEMLEQNLYD 
FEMLEQNLYD 
FEMLEQNLYD 
FEMLEQNLYD 
FELLEQNLFE 
FELLEQNLFE 
FELLEQNLFE 
FELLEQNLFE 
FELLSNNLYE 
VQMMEVCRRG 
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FLKQNKFSPL 
FLKQNKFSPL 
FLKQNKFSPL 
FLKQNKFSPL 
FLKQNKFSPL 
FLKQNKFSPL 
FLKQNKFSPL 
FLKQNKFMPL 
FQKENNFAPL 
FQKENNFAPL 
FQKENNFAPL 
FQKENNFAPL 
LLKQNKFHGL 
DFHTVQPAST 
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mushipk3_np_ PLKVIRPVLQ 
rathipk3_np_ PLKVIRPVLQ 
humhipk3_np_ PLKVIRPILQ 
humhipkl-lik PLKYIRPILQ 
mushipkl_np_ PLKYIRPILQ 
humhipk2_np_ PLKYIRPVLQ 
mushipk2_np_ PLKYIRPVLQ 
wormf2 0b6_ql PLNAIRPILF 
humhipk4.kd. PARH I RTVTL 
monkeyhipk4_ PARH I RTVTL 
mushipk4 . kd . PARH I RTVTL 
rathipk4_xp_ PARH I RTVTL 
yeastyakl_np SIQLIRTFTT 
dmcgl7090_q9 LVTNFVPSST 
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mushipk3_np_ FGSASHVSKT 
rathipk3_np_ FGSASHVSKT 
humhipk3_np_ FGSASHVSKT 
humhipkl-lik FGSASHVSKA 
mushipkl__np_ FGSASHVSKA 
humhipk2_np_ FGSASHVSKA 
mushipk2_np_ FGSASHVSKA 
wormf2 0b6_ql FGSASHRSKA 
humhipk4 . kd. FGSASIFSEV 
monkeyhipk4_ FGSASIFSEV 
mushipk4 .kd. FGSASIFSEV 
rathipk4_xp_ FGSASIFSEV 
yeastyakl_np FGSSCEEART 
dmcgl7090_q9 RQYP 
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mushipk3_np_ FLGWPLYPGA 

rathipk3_np_ FLGWPLYPGA 

humhipk3_np_ FLGWPLYPGA 

humhipkl-lik FLGWPLYPGA 

mushipkl_np_ FLGWPLYPGA 

humhipk2_np_ FLGWPLYPGA 

mushipk2_np_ FLGWPLYPGA 

wormf2 0b6_ql FLGWPLYPGS 

humhipk4 . kd . HLGWPLYPGN 

monkeyhipk4_ HLGWPLYPGN 

mushipk4 . kd . HLGWPLYPGN 

rathipk4_xp_ HLGWPLYPGN 

yeastyakl_np FLGIPIFPGA 

dmcgl70 90_q9 TMQPPLQVPP 



QVATALKKLK SLGLIHADLK 

QVATALKKLK SLGLIHADLK 

QVATALKKLK SLGLIHADLK 

QVATALM KLK SLGLIHADLK 

QVATALMKLK SLGLIHADLK 

QVATALMKLK SLGLIHADLK 

QVATALMKLK SLGLIHADLK 

QVLTALLKLK SLGLIHADLK 

QVLTALARLK ELAI IHADLK 

QVLRALARLK ELAI IHADLK 

QVLRALARLK ELAI IHADLK 

QVLRALARLK ELAI IHADLK 

QILDSLCVLK ESKLIHCDLK 

ENMTFTINNQ LTSQVQRLVR 



..VCSTYLQS RYYRAPEIIL 
. .VCSTYLQS RYYRAPEIIL 
..VCSTYLQS RYYRAPEIIL 
. .VCSTYLQS RYYRAPEIIL 
..VCSTYLQS RYYRAPEIIL 
..VCSTYLQS RYYRAPEIIL 
. .VCSTYLQS RYYRAPEIIL 
. . VTNTYLQS RYYRAPEIIL 
RYVKEPYIQS RFYRAPEILL 
RYVKEPYIQS RFYRAPEILL 
RYVKEPYIQS RFYRAPEILL 
RYVKEPYIQS RFYRAPEILL 
. . . VYTYIQS RFYRAPEIIL 
. . . . QTRTDS FQHQLVSNIL 



LEHDQIRYIS QTQGLPGEQL 

LEYDQIRYIS QTQGLPGEQL 

LEYDQIRYIS QTQGLPGEQL 

SEYDQIRYIS QTQGLPAEYL 

SEYDQIRYIS QTQGLPAEYL 

SEYDQIRYIS QTQGLPAEYL 

SEYDQIRYIS QTQGLPAEYL 

SEYDQIRFI I QTQGLPPTSM 

NEYDQVRYIC ETQGLPKPHL 

NEYDQVRYIC ETQGLPKPHL 

NEYDQVRYIC ETQGLPKPHL 

NEYDQVRYIC ETQGLPKPHL 

SEYNQLTRI I DTLGYPPSWM 

QQYVNVPVPV SMVEPTSGQR 
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PENIMLVDPV RQPYRVKVID 

PENIMLVDPV RQPYRVKVID 

PENIMLVDPV RQPYRVKVID 

PENIMLVDPV RQPYRVKVFD 

PENIMLVDPV RQPYRVKVID 

PENIMLVDPS RQPYRVKVID 

PENIMLVDPS RQPYRVKVID 

PENIMLVDPQ QQPYRVKVID 

PENIMLVDQT RCPFRVKVID 

PENIMLVDQT RCPFRVKVID 

PENIMLVDQT RCPFRVKVID 

PENIMLVDQT RCPFRVKVID 

PENILLCAPD KP. . ELKI ID 

DGRPLAYEGL YQI YNGRSVA 
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GLPFCEAIDM WSLGCVIAEL 

GLPFCEAIDM WSLGCVIAEL 

GLPFCEAIDM WSLGCVIAEL 

GLPFCEAIDM WSLGCVIAEL 

GLPFCEAIDM WSLGCVIAEL 

GLPFCEAIDM WSLGCVIAEL 

GLPFCEAIDM WSLGCVIAEL 

GLPFNESIDM WSLGCVIAEL 

GLPFCEKVDV WSLGCVMAEL 

GLPFCEKVDV WSLGCVMAEL 

GLPFCEKVDV WSLGCVMAEL 

GLPFCEKVDV WSLGCVMAEL 

GIPYSTSIDM WSLGCIVAEL 

CPPSYQTMPS PTKHVWGSA 
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LNVGTKSTRF FCRE 

LNVGTKSTRF FCRE 

LNVGTKSTRF FCKE 

LSAGTKTTRF FNRD 

LSAGTKTTRF FNRD 

LSAGTKTTRF FNRD 

LSAGTKTTRF FNRD 

LESASKLHRF FKEVKSESPN 

LHAACKAHHF FKRNP 

LHAARKAHHF FKRNP 

LHAARKAHHF FKRNP 

LHAARKAHHF FKRNP 

IDMGKNSGKF MKKLAPEE . . 

MLLTNRVQAS GVAWPQTG . . 



251 

mushipk3_np_ . TDMSHSGWR LKTLEEHEAE 

rathipk3_np_ . TDMSHSGWR LKTLEEHEAE 

humhipk3_np_ .TDMSHSGWR LKTLEEHEAE 

humhipkl-lik .PNLGYPLWR LKTPEEHELE 

mushipkl_np_ . PNLGYPLWR LKTPEEHELE 

humhipk2_np_ . TDSPYPLWR LKTPDDHEAE 

mushipk2_np_ . TDSPYPLWR LKTPDDHEAE 

wormf2 0b6_ql HTNVGGSYYR LKTVEEYEAS 

humhipk4 . kd . . HPDAANPWQ LKSSADYLAE 

monkeyhipk4_ . HPDAANPWQ LKSSADYLAE 

mushipk4 . kd . . HPDATNPWQ LKSSADYLAE 

rathipk4_xp_ . HPDATNPWQ LKSSADYLAE 

yeastyakl_np . SSSSTQKHR MKTIEEFCRE 

dmcgl7090_q9 RQMALVPSWP QQ APAHSL IV 
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mushipk3_np_ VMDLEGGDLL AEKADRREFV 

rathipk3_np_ VMDLEGSDLL AEKADRREFV 

humhipk3_np_ VMDLEGSDLL AEKADRREFV 

humhipkl-lik STDLEGTDML AEKADRRE Y I 

mushipkl_np_ STDLEGTDML AEKADRREYI 

humhipk2_np_ TTDLEGSDML VEKADRREFI 

mushipk2_np_ TTDLEGSDML VEKADRREFI 

wormf 2 0b6_ql GFESDPVEHL CDRIDRQEFV 

humhipk4.kd. RLTFPDREAL AEHADLKSMV 

monkeyhipk4_ RLTFPDREAL AEHADLKSMV 

mushipk4 . kd . RLSFPDREAL AEHADLKSMV 

rathipk4_xp_ RLSFPDREAL AEHADLKSMV 

yeastyakl_np SIQNSQELID QEMQNRECLI 

dmcgl7 090_q9 HHIAKGNSYR VPRHEKKEHQ 
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NLLKKMLLID ADLRITPIET LNHPFV 
SLLKKMLLID ADLRITPIET LNHPFV 
SLLKKMLLID ADLRITPAET LNHPFV 
DLLKKMLTID ADKRITPLKT LNHQFV 
DLLKKMLTID ADKRITPLKT LNHQFV 
DLLKKMLTID ADKRITPIET LNHPFV 
DLLKKMLTID ADKRVTPIET LNHPFV 
DVLKKMLVLN PDFRITPAEG LESKFV 
ELIKRMLTWE SHERISPSAA LRHPFV 
ELIKRMLTWE SHERISPSAA LRHPFV 
ELIKRMLTWE SHERISPSAA LRHPFV 
ELIKRMLTWE SHERISPSAA LRHPFV 
HFLGGVLNLN PLERWTPQQA MLHPFI 
QLSPVKKRVK ESSPPHQQRY QRAAHV 
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dmcg17090 q95tg3.kd.pep 

I yeostyok! rip 01 2394.kd.fos.pe 

— i humhipk4.kd.fos. pep 

'monkeyhipk4 ab074449.kd.fas.p 
- | mushipk4.kd.fos. pep 

'rothipk4 xp 2 1 8355. kd. fas. pep 

' wormf20b6 q 1 9632. kd. pep 

t humhipkl — like ax31 8266. kd. pep 
'mushipkl np 034562. kd. pep 
ihumhipk2 np 073577. kd. fas. pep 
I mushipk2 np 034563. kd. pep 
humhipk3 np 005725. kd. pep 
mushipk3 np 034564. kd. pep 
rathipk3 np 1 1 3975. kd.fos. pep 
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CLUSTAL W multiple sequence alignment results 



MSF: 340 Type: P Check: 1950 



Name : 


musdyrkla_np_031916 


.kd 


pep 


oo 


Len: 


340 


Check : 


8276 


Weight : 


2 


.2 


Name : 


r a t dyr k 1 a_np_0 36923 


.kd 


pep 


oo 


Len: 


340 


Check : 


8276 


Weight : 


2 


.2 


Name : 


humdyr k 1 a_np_0 013 87 


.kd 


pep 


oo 


Len: 


340 


Check : 


8161 


Weight : 


2 


.2 


Name : 


humdy r k 1 b_np_0 047 05 


.kd 


.pep 


oo 


Len: 


340 


Check : 


7992 


Weight : 


3 


.5 


Name : 


mus dyr k 1 b_np_0 34222 


.kd 


pep 


oo 


Len: 


340 


Check: 


6091 


Weight : 


2 


.6 


Name : 


dmmnb_aaf 48776 .kd. pep oo 


Len: 


340 


Check: 7580 


Weight: 5.3 







Name: wormt04cl0_q22 155 . kd . pep oo Len: 340 Check: 9628 Weight: 8.0 

Name: dmsmi3 5a_q9v3d5 . kd . pep oo Len: 340 Check: 6217 Weight: 8.4 

Name: humdyrk4_dyr4_human . kd . pep oo Len: 34 0 Check: 14 2 0 Weight: 8.0 

Name: humdyrk2_np_006473 . kd . pep oo Len: 34 0 Check: 8233 Weight: 6.6 

Name: humdyr k3_np_003 573 . kd . pep oo Len: 34 0 Check: 8166 Weight: 7.5 

Name: wormf 4 9ell_q20604 .kd.pep oo Len: 34 0 Check: 7542 Weight: 7.5 

Name: kab7_schpo . kd .pep oo Len: 34 0 Check: 97 98 Weight: 11.1 

Name: poml_schpo . kd . pep oo Len: 340 Check: 83 79 Weight: 9.8 

Name: humhipk4 . kd . pep oo Len: 34 0 Check: 8285 Weight: 3.5 

Name: monkeyhipk4_ab07444 9 . kd . f as .pe oo Len: 340 Check: 82 93 Weight: 3.5 

Name: mushipk4 . kd. pep oo Len: 340 Check: 982 9 Weight: 3.5 

Name: rathipk4_xp_2183 55 . kd . pep oo Len: 34 0 Check: 97 84 Weight: 3.5 
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r a t dy r k 1 a_np 
humdy rk 1 a_np 
humdy rk 1 b_np 
musdyrklb_np 
dmmnb_aa f 4 8 7 
wormt04cl0_q 
dmsmi3 5a_q9v 
humdy rk4 _dy r 
humdy rk2_np_ 
humdy rk3_np_ 
wormf 4 9ell_q 
kab7_schpo. k 
poml_schpo.k 
humhipk4 . kd . 
monkeyhipk4_ 
mushipk4 . kd . 
rathipk4_xp_ 
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YEIDSLIGKG 
YEIDSLIGKG 
YEIDSLIGKG 
YEIDSLIGKG 
YEIDSLIGKG 
YEIDSLIGKG 
ILSDTPVGKG 
YEILEVIGKG 
YEVLETIGKG 
YEVLKVI GKG 
YEVLKI IGKG 
YEVLKVI GKG 
YE I IDTVGKG 
YEWDFLGKG 
YD I I E VLGKG 
YDI IEVLGKG 
YD I I EVLGKG 
YD I IEVLGKG 



SFGQWKAYD 
SFGQWKAYD 
SFGQWKAYD 
SFGQWKAYD 
SFGQWKAYD 
SFGQWKAYD 
SFGQVTKAYD 
SFGQVIRALD 
SFGQVAKCLD 
SFGQWKAYD 
SFGQVARVYD 
SFGQVIKAFD 
SFGQVLKCID 
SFGQVLRCID 
TFGEVAKGWR 
TFGEVAKGWR 
TFGEVAKGWR 
TFGEVAKGWR 



RVEQEWVAIK 
RVEQEWVAIK 
RVEQEWVAIK 
HQTQELVAIK 
HQTQELVAIK 
HEEQCHVAIK 
TLNKEEVAIK 
HKTNTHVAIK 
HKNNELVALK 
HKVHQHVALK 
HKLRQYVALK 
HKYQQYVALK 
HKRGQWAI K 
YETGKLVALK 
RSTGEMVAIK 
RSTGEMVAIK 
RSTGEMVAIK 
RSTGEMVAIK 



I I KNKKAFLN 
I I KNKKAFLN 
I I KNKKAFLN 
I I KNKKAFLN 
I I KNKKAFLN 
IIKNKKPFLN 
I I KNKKTFFD 
I IRNKKRFLN 
I IRNKKRFHQ 
MVRNEKRFHR 
MVRNEKRFHR 
LVRNEKRFHR 
VIKNRQKFHG 
I IRNKKRFHM 
I LKNDAYRNR 
ILKNDAYRNR 
ILKNDAYRSR 
ILKNDAYRSR 
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QAQIEVRLLE 
QAQIEVRLLE 
QAQIEVRLLE 
QAQIELRLLE 
QAQIELRLLE 
QAQIEVKLLE 
QAQIEIHLLE 
QAWELNILD 
QALMELKILE 
QAAEEIRILE 
QAAEEIRILE 
QADEEIRILD 
QTLVEVGILK 
QALVETKILQ 
I IKNELKLLH 
I IKNELKLLH 
I IKNELKLLR 
I IKNELKLLR 



/ 
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musdyrkla_np LMNKHDTEMK YYIVHLKRHF 
ratdyrkla_np LMNKHDTEMK YYIVHLKRHF 
humdyrkla_np LMNKHDTEMK YYIVHLKRHF 
humdyrklb_np LMNQHDTEMK YYIVHLKRHF 
musdyrklb_np LMNQHDTEMK YYIVHLKRHF 
dmmnb_aaf487 MMNRADAENK YYIVKLKRHF 
wormt04cl0_q LTNAHDKDNK YNIVTLKGHF 
dmsmi35a_q9v ELREKDADGS HNVIHMLDYT 
humdyrk4_dyr ALRKKDKDNT YNWHMKDFF 
humdyrk2_np_ HLRKQDKDNT MNVIHMLENF 
humdyrk3_np_ HLKKQDKTGS MNVIHMLESF 
wormf49ell_q HLRRQDSDGT HNI IHMLDYF 
kab7__schpo.k RLCEADPADK NNVIRYLSHF 
poml_schpo.k KIREWDPLDE YCMVQYTDHF 
humhipk4 . kd . CMRGLDPE . E AHVIRFLEFF 
monkeyhipk4_ CMRGLDPE . E AHVIRFLEFF 
mushipk4 . kd . CVRGLDPD . E AHVIRFLEFF 
rathipk4_xp_ CVRGLDPD . E AHVIRFLEFF 
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musdyrkla_np LNLTRKFAQQ MCTALL FLAT 
ratdyrkla_np LNLTRKFAQQ MCTALLFLAT 
humdyrkla_np LNLTRKFAQQ MCTALL FLAT 
humdyrklb_np LNLTRKLAQQ LCTALLFLAT 
musdyrklb_np LNLTRKLAQQ LCTALLFLAT 
dmmnb_aaf4 87 LNLTRKFAQQ LCTALLFLST 
wormt04cl0_q LNLARKFAQQ LGKTLLFLSS 
dmsmi3 5a_q9v MSLIRRFCNS IVKCLRLLY. 
humdyrk4_dyr LSIVRRFTLS VLKCLQMLS . 
humdyrk2_np_ LPLVRKFAHS ILQCLDALH. 
humdyrk3_np_ VQLVRKFAQS ILQSLDALH . 
wormf4 9ell_q LMLVRKFAYS MLLCLDLLQ . 
kab7_schpo.k LIWKSFALQ GLQALRLLQ . 
poml_schpo.k IWIKSITRQ LIQCLTLLN. 
humhipk4.kd. ARHIRTVTLQ VLTALARLK. 
monkeyhipk4_ ARHIRTVTLQ VLRALARLK. 
mushipk4 . kd . ARHIRTVTLQ VLRALARLK . 
rathipk4_xp_ ARHIRTVTLQ VLRALARLK. 
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MFRNHLCLVF EMLSYNLYDL LRNTNFRGVS 
MFRNHLCLVF EMLSYNLYDL LRNTNFRGVS 
MFRNHLCLVF EMLSYNLYDL LRNTNFRGVS 
MFRNHLCLVF ELLSYNLYDL LRNTHFRGVS 
MFRNHLCLVF ELLSYNLYDL LRNTHFRGVS 
MWRNHLCLVF ELLSYNLYDL LRNTNFRGVS 
VHRAHLCLVF ELLSYNLYDL LKNTSFRGVS 
YFRKHLCITF ELMS LNLYEL I KKNNYNGFS 
YFRNHFCITF ELLGINLYEL MKNNNFQGFS 
TFRNHICMTF ELLSMNLYEL IKKNKFQGFS 
TFRNHVCMAF ELLSIDLYEL IKKNKFQGFS 
NFRNHKCITF ELLSINLYEL IKRNKFQGFS 
DFRGHLCIVT ELLGSNLFDV^ IRENNYKGLP 
YFRDHLCVAT ELLGKNLYEL IKSNGFKGLP 
HDALKFYLVF ELLEQNLFEF QKENNFAPLP 
HDALKFYLVF ELLEQNLFEF QKENNFAPLP 
HDALKFYLVF ELLEQNLFEF QKENNFAPLP 
HDALKFYLVF ELLEQNLFEF QKENNFAPLP 
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PELSIIHCDL KPENILLCNP KRS . . AIKIV 

PELSIIHCDL KPENILLCNP KRS.. AIKIV 

PELSIIHCDL KPENILLCNP KRS.. AIKIV 

PELSIIHCDL KPENILLCNP KRS.. AIKIV 

PELSIIHCDL KPENILLCNP KRS.. AIKIV 

PELNI IHCDL KPENILLCNP KRS.. AIKIV 

PELSIIHCDL KPENVLLVNA KRS . . QIRVI 

. KENI IHCDL KPENILLKQR GSS..SIKVI 

.VEKI IHCDL KPENIVLYQK GQA..SVKVI 

. KNRIIHCDL KPENILLKQQ GRS..GIKVI 

. KNKI IHCDL KPENILLKHH GRS . . STKVI 

. KNRLIHCDL KPENVLLKQQ GRS..GIKVI 

. GQNI IHCDL KPENLLLSHP LKA..RIKLI 

. EKHVIHCDL KPENILLCHP FKS . . QVKVI 

. ELAI IHADL KPENIMLVDQ TRCPFRVKVI 

. ELAIIHADL KPENIMLVDQ TRCPFRVKVI 

. ELAIIHADL KPENIMLVDQ TRCPFRVKVI 

.ELAIIHADL KPENIMLVDQ TRCPFRVKVI 
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musdyrkla_np 
ratdyrkla_np 
humdy r k 1 a_np 
humdy r k 1 b_np 
mu s dy r k 1 b_np 
dmmnb_aaf 4 87 
wormt 0 4 c 1 0_q 
dmsmi 3 5a_q9v 
humdy rk4_dyr 
humdyrk2_np_ 
humdy rk3_np_ 
wormf 4 9ell_q 
kab7_schpo . k 
poml_schpo . k 
humhipk4 . kd . 
monkeyhipk4_ 
mushipk4 . kd . 
rathipk4_xp_ 



musdyrkla_np 
r a t dy rk 1 a_np 
humdy rk 1 a_np 
humdy r k 1 b_np 
mu s dy r k 1 b_np 
dmmnb_aa f 4 8 7 
wormt 0 4 c 1 0_q 
dmsmi 3 5a_q9 v 
humdyrk4_dyr 
humdy r k 2 _np_ 
humdy rk 3 _np_ 
wormf 4 9ell_q 
kab7_schpo . k 
poml_schpo . k 
humhipk4 . kd . 
monkeyhipk4_ 
mushipk4 . kd. 
rathipk4_xp_ 
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DFGSSCQLGQ 
DFGSSCQLGQ 
DFGSSCQLGQ 
DFGSSCQLGQ 
DFGSSCQLGQ 
DFGSSCQLGQ 
DFGSSCQTGH 
DFGSSCYVDR 
DFGSSCYEHQ 
DFGSSCYEHQ 
DFGSSCFEYQ 
DFGSSCFDDQ 
DFGSSCFYNE 
DFGSSCFEGE 
DFGSASIFSE 
DFGSASIFSE 
DFGSASIFSE 
DFGSASIFSE 

201 

MHTGEPLFSG 
MHTGEPLFSG 
MHTGEPLFSG 
MHTGEPLFSG 
MHTGEPLFSG 
MHTGEPLFSG 
MHTGEPLFAG 
LYTGFPLFPG 
LYTGYPLFPG 
LLTGYPLLPG 
LLTGQPLFPG 
LLTGYPLLPG 
LFTGVPLFPG 
MYTGFPLFPG 
LHLGW PLY PG 
LHLGWPLYPG 
LHLGWPLYPG 
LHLGWPLYPG 



R. . . IYQYIQ 
R. . . IYQYIQ 
R. . . IYQYIQ 
R. . . IYQYIQ 
R. . . IYQYIQ 
R. . . IYHYIQ 
R. . . IYQYIQ 
K. . . IYTYIQ 
K . . . VYTYIQ 
R. . .VYTYIQ 
K . . . LYTYIQ 
R. . . IYTYIQ 
K . . . VYTYLQ 
C. . .VYTYIQ 
VRYVKEPYIQ 
VRYVKEPYIQ 
VRYVKEPYIQ 
VRYVKEPYIQ 



ANEVDQMNKI 
ANEVDQMNKI 
ANEVDQMNKI 
SNEVDQMNRI 
SNEVDQMSRI 
CNEVDQMNKI 
SSEVDQMMKI 
ENEVEQLACI 
ENEVEQLACI 
EDEGDQLACM 
EDEGDQLACM 
EDENDQLAL I 
GNETEQLGYI 
ENEQEQLACI 
NNEYDQVRYI 
NNE YDQVRY I 
NNEYDQVRYI 
NNEYDQVRYI 



SRFYRSPEVL 
SRFYRS PEVL 
SRFYRSPEVL 
SRFYRSPEVL 
SRFYRSPEVL 
SRFYRSPEVL 
SRFYRSPEVL 
SRFYRSPEVI 
SRFYRSPEVI 
SRFYRAPEVI 
SRFYRAPEI I 
SRFYRAPEVI 
SRFYRAPEI I 
SRFYRSPEVI 
SRFYRAPEIL 
SRFYRAPEIL 
SRFYRAPEIL 
SRFYRAPEIL 



VEVLGIPPAH 
VEVLGIPPAH 
VEVLGIPPAH 
VEVLGIPPAA 
VEVLGIPPAP 
VEVLGMPPKY 
VEVLGMPPKE 
MEVLGLPPKV 
MEVLGL P PAG 
IELLGMPSQK 
MELLGMPPPK 
IELLGMPPPK 
MEVLGPPPMA 
MEIFGPPDHS 
CETQGLPKPH 
CETQGLPKPH 
CETQGLPKPH 
CETQGLPKPH 



LGMPYDLAID 
LGMPYDLAID 
LGMPYDLAID 
LGTPYDLAID 
LGTPYDLAID 
LGIQYDLAID 
LGIAYDTKID 
LGLQYGTAID 
LGHPYDVAID 
LGARYGMPID 
LGSRYSTPID 
LGTKYGMPID 
LGLEYGKEID 
LGMGYGTPID 
LGLPFCEKVD 
LGLPFCEKVD 
LGLPFCEKVD 
LGLPFCEKVD 



I LDQAPKARK 
ILDQAPKARK 
I LDQAPKARK 
MLDQAPKARK 
MLEQAPKARK 
LLDQAHKTRK 
MLDIGPKTHK 
L I S VARRRRL 
FIQTASRRQT 
LLDAS KRAKN 
LLEQSKRAKY 
SLETAKRART 
LIRNSTRSKA 
LIDKCSRKKV 
LLHAACKAHH 
LLHAARKAHH 
LLHAARKAHH 
LLHAARKAHH 
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MWSLGCILVE 
MWSLGCILVE 
MWSLGCILVE 
MWSLGCILVE 
MWSLGCILVE 
MWSLGCILVE 
MWSLGCILVE 
MWSLGCILAE 
MWSLGCITAE 
MWSLGCILAE 
IWSFRCILAE 
MWSLGCILAE 
IWSFGCILAE 
VWSLGCIIAE 
VWSLGCVMAE 
VWSLGCVMAE 
VWSLGCVMAE 
VWSLGCVMAE 

250 

FFEKLPDGTW 
FFEKLPDGTW 
FFEKLPDGTW 
YFERLPGGGW 
YFERLPGGGW 
FFDKIVADGS 
YFDKTEDGIY 
FFDSRDAPRC 
FFDSKGFPKN 
FVSSKGYPRY 
FINSKGIPRY 
FITSKGYPRY 
YFDSEGKPHP 
FFDSSGKPRP 
FFKRNPHPDA 
FFKRNPHPDA 
FFKRNPHPDA 
FFKRNPHPDA 



musdyrkla_np 
ratdyrkla_np 
humdy rk 1 a_np 
humdy r k 1 b_np 
mu sdy r k 1 b_np 
dmmnb_aa f 4 8 7 
wo rm 1 0 4 c 1 0_q 
dmsmi 3 5a_q9v 
humdyrk4_dyr 
humdy rk2_np_ 
humdy rk3_np_ 
wormf 4 9ell_q 
kab7_schpo . k 
poml_schpo . k 
humhipk4 . kd . 
monkeyhipk4_ 
mushipk4 . kd - 
rathipk4_xp_ 



mus dy r k 1 a_np 
r a t dy r k 1 a_np 
humdy rk 1 a_np 
humdy rklb_np 
mu s dy r k 1 b_np 
dmmnb_aa f 4 8 7 
wormt 0 4 c 1 0_q 
dmsmi 3 5a_q9v 
humdy rk4_dyr 
humdy r k 2 _np_ 
humdy r k 3 _np_ 
wormf 4 9ell_q 
kab7_schpo . k 
poml_schpo . k 
humhipk4 . kd . 
monkeyhipk4_ 
mushipk4 . kd . 
rathipk4_xp_ 



DGKREYKPPG 
DGKREYKPPG 
DGKREYKPPG 
ELRKDYQGPG 
ELR 



251 

SLKKT K 

SLKKT K 

NLKKT K 

TLRRT K 

TLRRT K 

YVLKK N QNGRKYKPPG 

YCKKTR D GYRHTYKAPG 

I TNT 

ITNN 

CTVTT LSDGSV 

CSVTT QADGRV 

CTATS MPDGSV 

ITDS 

FVSS 

ANPWQLKSSA DYLAETKVRP LERRKYMLKS 
ANPWQLKSSA DYLAETKVRP LERRKYMLKS 
TNPWQLKSSA DYLAETKVRP LERRKYMLKS 
TNPWQLKS S A DYLAETKVRP LERRKYMLKS 



300 

TRKLHNILGV ETGGPGGRRA 
TRKLHNILGV ETGGPGGRRA 
TRKLHNILGV ETGGPGGRRA 
TRRLQEVLGV QTGGPGGRRA 



SRKLHDILGV 
ARKLHE I LGV 

KG 

RG 

VLNGGRSRRG 
VLVGGRS RRG 
VLAGARS KRG 



301 

GESGHTVADY 
GESGHTVADY 
GESGHTVADY 
GEPGHSPADY 

DEPGHSVSDY 
GEPGHSVEDY 
. LAHILHCQD 
. LTMVLKTYD 
WGNALKGCDD 
WGTALKGCDD 
WSTALKNMGD 
TFSQLLNTKQ 
SLHQVLQCKD 
R E ALA E HAD L 
REALAEHADL 
REALAEHADL 
REALAEHADL 



LKFKDLILRM 
LKFKDLILRM 
LKFKDLILRM 
LRFQDLVLRM 
. . . KDLVLRM 
LKFKDLILRM 
SKFKDLIKRM 
RYFIDFLQRC 
TSFLDFLRRC 
PLFLDFLKQC 
YLFIEFLKRC 
ELFVDFLKRC 
ASFLDFLSKC 
VSFLSFISDC 
KSMVELIKRM 
KSMVEL I KRM 
KSMVELIKRM 
KSMVELIKRM 



LDYDPKTRIQ 
LDYDPKTRIQ 
LDYDPKTRIQ 
LEYEPAARIS 
LEYEPAARIS 
LDFD PKTRVT 
LQFDPKQRIS 
LEWDPAERMT 
LVWE PSLRMT 
LEWDPAVRMT 
LHWDPSARLT 
LDWD PETRMT 
LKWDPKDRIT 
LKWDPDERMT 
LTWESHERIS 
LTWESHERIS 
LTWESHERIS 
LTWESHERIS 



LDQIETVNGG 
LDQIETVNGG 
LDQIETVNGG 
LDQIETVNGG 

340 

PYYALQHSFF 
PYYALQHSFF 
PYYALQHSFF 
PLGALQHGFF 
PLGALQHGFF 
PYYALQHNFF 
PYYWRHPFL 
PDEAAHHEFL 
PDQALKHAWI 
PGQALRHPWL 
PAQALRHPWI 
PAQALKHKWL 
VDSALQHEFI 
PQQAAQHDFL 
PSAALRHPFV 
PSAALRHPFV 
PSAALRHPFV 
PSAALRHPFV 



ETGGPGGRRL 
TSGGPGGRRL 
RKRSPGSKS . 
KKRYPDSKD . 
KLRGPPESRE 
KKRGPPGSKD 
KMRGPPASRS 
HNRLLVPSTR 
KGVSRRPFSK 
SVASRLTFPD 
SVASRLTFPD 
GAVSRLSFPD 
GAVNRLSFPD 



dmmnb ocf48776.kd.pep 
humdyrklc np 001 387. kd.pep 
musdyrkio np 031 9 i 6. kd.pep 
rctdyrklc np 036923. kd.pep 
humdyrklb np 004705. kd.pep 
musdyrklb np 034222. kd.pep 
wormi04clO q22 1 55. kd.pep 
humhipk4. kd.pep 
monkeyhipk4 ab074449.kd.fas.p 
mushipk4. kd.pep 
rathipk4 xp 21 8355. kd.pep 



A'. 



h: 



— kob7 schpo. kd.pep 
•pomt schpo. kd.pep 



-humdyrk2 np 006473.kd.pep 
— humdyrk3 np 003573. kd.pep 



wormf49e1t q20604. kd.pep 

-dmsmi35c q9v3d5. kd.pep 
• hurndyrk4 dyr4 human. kd.pep 
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